"សទ្ទ" meaning in All languages combined

See សទ្ទ on Wiktionary

Noun [Khmer]

Forms: sattĕəʼ [romanization]
Etymology: From Pali sadda, from Sanskrit शब्द (śabda). Doublet of សព្ទ (sap). Etymology templates: {{bor|km|pi|sadda}} Pali sadda, {{der|km|sa|शब्द}} Sanskrit शब्द (śabda), {{doublet|km|សព្ទ}} Doublet of សព្ទ (sap) Head templates: {{km-noun}} សទ្ទ • (sattĕəʼ)
  1. sound, voice, phono-
    Sense id: en-សទ្ទ-km-noun-Axa1XxqL
  2. word, language
    Sense id: en-សទ្ទ-km-noun-pXOppPvV Categories (other): Khmer entries with incorrect language header, Khmer terms with non-redundant manual transliterations, Khmer terms with redundant script codes Disambiguation of Khmer entries with incorrect language header: 24 76 Disambiguation of Khmer terms with non-redundant manual transliterations: 19 81 Disambiguation of Khmer terms with redundant script codes: 17 83

Noun [Pali]

Head templates: {{pi-noun|m}} សទ្ទ m Inflection templates: {{pi-decl-noun|g=m}} Forms: masculine [table-tags], សទ្ទោ [nominative, singular], សទ្ទា [nominative, plural], សទ្ទំ [accusative, singular], សទ្ទេ [accusative, plural], សទ្ទេន [instrumental, singular], សទ្ទេហិ [instrumental, plural], សទ្ទេភិ [instrumental, plural], សទ្ទស្ស [dative, singular], សទ្ទាយ [dative, singular], សទ្ទត្ថំ [dative, singular], សទ្ទានំ [dative, plural], សទ្ទស្មា [ablative, singular], សទ្ទម្ហា [ablative, singular], សទ្ទា [ablative, singular], សទ្ទេហិ [ablative, plural], សទ្ទេភិ [ablative, plural], សទ្ទស្ស [genitive, singular], សទ្ទានំ [genitive, plural], សទ្ទស្មិំ [locative, singular], សទ្ទម្ហិ [locative, singular], សទ្ទេ [locative, singular], សទ្ទេសុ [locative, plural], សទ្ទ [singular, vocative], សទ្ទា [plural, vocative]
  1. Khmer script form of sadda (“sound”) Tags: Khmer, character, form-of, masculine Form of: sadda (extra: sound)
    Sense id: en-សទ្ទ-pi-noun-S50R3uW- Categories (other): Khmer script, Pali entries with incorrect language header Synonyms: sadda [Latin, character], 𑀲𑀤𑁆𑀤 [Brahmi, character], सद्द [Devanagari, character], সদ্দ [Bengali], සද්ද [Sinhalese, character], သဒ္ဒ, သၻ္ၻ, သၻ်ၻ [Burmese, character], สทฺท, สัททะ [Thai, character], ᩈᨴ᩠ᨴ [Tai-Tham, character], ສທ຺ທ, ສັທທະ [Lao, character], 𑄥𑄘𑄴𑄘 [Chakma, character]

Download JSON data for សទ្ទ meaning in All languages combined (5.8kB)

{
  "etymology_templates": [
    {
      "args": {
        "1": "km",
        "2": "pi",
        "3": "sadda"
      },
      "expansion": "Pali sadda",
      "name": "bor"
    },
    {
      "args": {
        "1": "km",
        "2": "sa",
        "3": "शब्द"
      },
      "expansion": "Sanskrit शब्द (śabda)",
      "name": "der"
    },
    {
      "args": {
        "1": "km",
        "2": "សព្ទ"
      },
      "expansion": "Doublet of សព្ទ (sap)",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Pali sadda, from Sanskrit शब्द (śabda). Doublet of សព្ទ (sap).",
  "forms": [
    {
      "form": "sattĕəʼ",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "សទ្ទ • (sattĕəʼ)",
      "name": "km-noun"
    }
  ],
  "lang": "Khmer",
  "lang_code": "km",
  "pos": "noun",
  "senses": [
    {
      "categories": [],
      "examples": [
        {
          "text": "សទ្ទតា ― sattĕəʼtaa ― phoneme",
          "type": "example"
        }
      ],
      "glosses": [
        "sound, voice, phono-"
      ],
      "id": "en-សទ្ទ-km-noun-Axa1XxqL",
      "links": [
        [
          "sound",
          "sound"
        ],
        [
          "voice",
          "voice"
        ],
        [
          "phono-",
          "phono-"
        ]
      ]
    },
    {
      "categories": [
        {
          "_dis": "24 76",
          "kind": "other",
          "name": "Khmer entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "19 81",
          "kind": "other",
          "name": "Khmer terms with non-redundant manual transliterations",
          "parents": [
            "Terms with non-redundant manual transliterations",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        },
        {
          "_dis": "17 83",
          "kind": "other",
          "name": "Khmer terms with redundant script codes",
          "parents": [
            "Terms with redundant script codes",
            "Entry maintenance"
          ],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "word, language"
      ],
      "id": "en-សទ្ទ-km-noun-pXOppPvV",
      "links": [
        [
          "word",
          "word"
        ],
        [
          "language",
          "language"
        ]
      ]
    }
  ],
  "word": "សទ្ទ"
}

{
  "forms": [
    {
      "form": "masculine",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "pi-decl-noun",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "សទ្ទោ",
      "roman": "saddo",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទំ",
      "roman": "saddaṃ",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេ",
      "roman": "sadde",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេន",
      "roman": "saddena",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេហិ",
      "roman": "saddehi",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេភិ",
      "roman": "saddebhi",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្ស",
      "roman": "saddassa",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទាយ",
      "roman": "saddāya",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទត្ថំ",
      "roman": "saddatthaṃ",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទានំ",
      "roman": "saddānaṃ",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្មា",
      "roman": "saddasmā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទម្ហា",
      "roman": "saddamhā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេហិ",
      "roman": "saddehi",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេភិ",
      "roman": "saddebhi",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្ស",
      "roman": "saddassa",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "សទ្ទានំ",
      "roman": "saddānaṃ",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្មិំ",
      "roman": "saddasmiṃ",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទម្ហិ",
      "roman": "saddamhi",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេ",
      "roman": "sadde",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេសុ",
      "roman": "saddesu",
      "source": "declension",
      "tags": [
        "locative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទ",
      "roman": "sadda",
      "source": "declension",
      "tags": [
        "singular",
        "vocative"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "plural",
        "vocative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "m"
      },
      "expansion": "សទ្ទ m",
      "name": "pi-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "g": "m"
      },
      "name": "pi-decl-noun"
    }
  ],
  "lang": "Pali",
  "lang_code": "pi",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Khmer script",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pali entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        }
      ],
      "form_of": [
        {
          "extra": "sound",
          "word": "sadda"
        }
      ],
      "glosses": [
        "Khmer script form of sadda (“sound”)"
      ],
      "id": "en-សទ្ទ-pi-noun-S50R3uW-",
      "links": [
        [
          "sadda",
          "sadda#Pali"
        ]
      ],
      "synonyms": [
        {
          "tags": [
            "Latin",
            "character"
          ],
          "word": "sadda"
        },
        {
          "tags": [
            "Brahmi",
            "character"
          ],
          "word": "𑀲𑀤𑁆𑀤"
        },
        {
          "tags": [
            "Devanagari",
            "character"
          ],
          "word": "सद्द"
        },
        {
          "tags": [
            "Bengali"
          ],
          "word": "সদ্দ"
        },
        {
          "tags": [
            "Sinhalese",
            "character"
          ],
          "word": "සද්ද"
        },
        {
          "word": "သဒ္ဒ"
        },
        {
          "word": "သၻ္ၻ"
        },
        {
          "tags": [
            "Burmese",
            "character"
          ],
          "word": "သၻ်ၻ"
        },
        {
          "word": "สทฺท"
        },
        {
          "tags": [
            "Thai",
            "character"
          ],
          "word": "สัททะ"
        },
        {
          "tags": [
            "Tai-Tham",
            "character"
          ],
          "word": "ᩈᨴ᩠ᨴ"
        },
        {
          "word": "ສທ຺ທ"
        },
        {
          "tags": [
            "Lao",
            "character"
          ],
          "word": "ສັທທະ"
        },
        {
          "tags": [
            "Chakma",
            "character"
          ],
          "word": "𑄥𑄘𑄴𑄘"
        }
      ],
      "tags": [
        "Khmer",
        "character",
        "form-of",
        "masculine"
      ]
    }
  ],
  "word": "សទ្ទ"
}
{
  "categories": [
    "Khmer doublets",
    "Khmer entries with incorrect language header",
    "Khmer lemmas",
    "Khmer nouns",
    "Khmer terms borrowed from Pali",
    "Khmer terms derived from Pali",
    "Khmer terms derived from Sanskrit",
    "Khmer terms with IPA pronunciation",
    "Khmer terms with non-redundant manual transliterations",
    "Khmer terms with redundant script codes"
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "km",
        "2": "pi",
        "3": "sadda"
      },
      "expansion": "Pali sadda",
      "name": "bor"
    },
    {
      "args": {
        "1": "km",
        "2": "sa",
        "3": "शब्द"
      },
      "expansion": "Sanskrit शब्द (śabda)",
      "name": "der"
    },
    {
      "args": {
        "1": "km",
        "2": "សព្ទ"
      },
      "expansion": "Doublet of សព្ទ (sap)",
      "name": "doublet"
    }
  ],
  "etymology_text": "From Pali sadda, from Sanskrit शब्द (śabda). Doublet of សព្ទ (sap).",
  "forms": [
    {
      "form": "sattĕəʼ",
      "tags": [
        "romanization"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {},
      "expansion": "សទ្ទ • (sattĕəʼ)",
      "name": "km-noun"
    }
  ],
  "lang": "Khmer",
  "lang_code": "km",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Khmer terms with usage examples"
      ],
      "examples": [
        {
          "text": "សទ្ទតា ― sattĕəʼtaa ― phoneme",
          "type": "example"
        }
      ],
      "glosses": [
        "sound, voice, phono-"
      ],
      "links": [
        [
          "sound",
          "sound"
        ],
        [
          "voice",
          "voice"
        ],
        [
          "phono-",
          "phono-"
        ]
      ]
    },
    {
      "glosses": [
        "word, language"
      ],
      "links": [
        [
          "word",
          "word"
        ],
        [
          "language",
          "language"
        ]
      ]
    }
  ],
  "word": "សទ្ទ"
}

{
  "forms": [
    {
      "form": "masculine",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "pi-decl-noun",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "សទ្ទោ",
      "roman": "saddo",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទំ",
      "roman": "saddaṃ",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេ",
      "roman": "sadde",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេន",
      "roman": "saddena",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេហិ",
      "roman": "saddehi",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេភិ",
      "roman": "saddebhi",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្ស",
      "roman": "saddassa",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទាយ",
      "roman": "saddāya",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទត្ថំ",
      "roman": "saddatthaṃ",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទានំ",
      "roman": "saddānaṃ",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្មា",
      "roman": "saddasmā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទម្ហា",
      "roman": "saddamhā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេហិ",
      "roman": "saddehi",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទេភិ",
      "roman": "saddebhi",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្ស",
      "roman": "saddassa",
      "source": "declension",
      "tags": [
        "genitive",
        "singular"
      ]
    },
    {
      "form": "សទ្ទានំ",
      "roman": "saddānaṃ",
      "source": "declension",
      "tags": [
        "genitive",
        "plural"
      ]
    },
    {
      "form": "សទ្ទស្មិំ",
      "roman": "saddasmiṃ",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទម្ហិ",
      "roman": "saddamhi",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេ",
      "roman": "sadde",
      "source": "declension",
      "tags": [
        "locative",
        "singular"
      ]
    },
    {
      "form": "សទ្ទេសុ",
      "roman": "saddesu",
      "source": "declension",
      "tags": [
        "locative",
        "plural"
      ]
    },
    {
      "form": "សទ្ទ",
      "roman": "sadda",
      "source": "declension",
      "tags": [
        "singular",
        "vocative"
      ]
    },
    {
      "form": "សទ្ទា",
      "roman": "saddā",
      "source": "declension",
      "tags": [
        "plural",
        "vocative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "m"
      },
      "expansion": "សទ្ទ m",
      "name": "pi-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "g": "m"
      },
      "name": "pi-decl-noun"
    }
  ],
  "lang": "Pali",
  "lang_code": "pi",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Khmer script",
        "Pali entries with incorrect language header",
        "Pali lemmas",
        "Pali masculine nouns",
        "Pali nouns",
        "Pali nouns in Khmer script"
      ],
      "form_of": [
        {
          "extra": "sound",
          "word": "sadda"
        }
      ],
      "glosses": [
        "Khmer script form of sadda (“sound”)"
      ],
      "links": [
        [
          "sadda",
          "sadda#Pali"
        ]
      ],
      "tags": [
        "Khmer",
        "character",
        "form-of",
        "masculine"
      ]
    }
  ],
  "synonyms": [
    {
      "tags": [
        "Latin",
        "character"
      ],
      "word": "sadda"
    },
    {
      "tags": [
        "Brahmi",
        "character"
      ],
      "word": "𑀲𑀤𑁆𑀤"
    },
    {
      "tags": [
        "Devanagari",
        "character"
      ],
      "word": "सद्द"
    },
    {
      "tags": [
        "Bengali"
      ],
      "word": "সদ্দ"
    },
    {
      "tags": [
        "Sinhalese",
        "character"
      ],
      "word": "සද්ද"
    },
    {
      "word": "သဒ္ဒ"
    },
    {
      "word": "သၻ္ၻ"
    },
    {
      "tags": [
        "Burmese",
        "character"
      ],
      "word": "သၻ်ၻ"
    },
    {
      "word": "สทฺท"
    },
    {
      "tags": [
        "Thai",
        "character"
      ],
      "word": "สัททะ"
    },
    {
      "tags": [
        "Tai-Tham",
        "character"
      ],
      "word": "ᩈᨴ᩠ᨴ"
    },
    {
      "word": "ສທ຺ທ"
    },
    {
      "tags": [
        "Lao",
        "character"
      ],
      "word": "ສັທທະ"
    },
    {
      "tags": [
        "Chakma",
        "character"
      ],
      "word": "𑄥𑄘𑄴𑄘"
    }
  ],
  "word": "សទ្ទ"
}
{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Nominative (first)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Accusative (second)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Instrumental (third)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Dative (fourth)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Ablative (fifth)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Genitive (sixth)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Locative (seventh)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

{
  "called_from": "inflection/2466",
  "msg": "accepted heuristic header: table cell identified as header and given candidate status, AND the cleaned text is in LANGUAGES_WITH_CELLS_AS_HEADERS[Pali]; cleaned text: Vocative (calling)",
  "path": [
    "សទ្ទ"
  ],
  "section": "Pali",
  "subsection": "noun",
  "title": "សទ្ទ",
  "trace": ""
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-06-04 from the enwiktionary dump dated 2024-05-02 using wiktextract (e9e0a99 and db5a844). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.